Fix img2img DDIM index out of bound #1137

wfng92 · 2022-10-17T09:02:41Z

Added a community solution to fix index out of bound when doing img2img generation with ddim sampler. Also, restored steps_out to be ddim_timesteps + 1 since the removal was meant to fix the 1000 steps issue

Added a [community solution](CompVis/stable-diffusion#111 (comment)) to fix index out of bound when doing img2img generation with `ddim` sampler. Also, restored `steps_out` to be `ddim_timesteps + 1` since the removal was meant to fix the [1000 steps issue](CompVis/stable-diffusion#111)

lstein · 2022-10-17T15:54:13Z

Thanks for this. Is there a test case that you use to test the bug before and after the fix? It is certain values of step, is that right?

wfng92 · 2022-10-18T01:57:51Z

Thanks for this. Is there a test case that you use to test the bug before and after the fix? It is certain values of step, is that right?

After merging version 2.0 to my local machine, I encountered the following error frequently during img2img generation:

../aten/src/ATen/native/cuda/ScatterGatherKernel.cu:145: operator(): block: [0,0,0], thread: [0,0,0] Assertion `idx_dim >= 0 && idx_dim < index_size && "index out of bounds"` failed.

Traceback (most recent call last):
  File "/stable-diffusion/./ldm/generate.py", line 388, in prompt2image
    results = generator.generate(
  File "/stable-diffusion/./ldm/invoke/generator/base.py", line 78, in generate
    image = make_image(x_T)
  File "/stable-diffusion/./ldm/invoke/generator/img2img.py", line 45, in make_image
    samples = sampler.decode(
  File "/usr/local/lib/python3.9/site-packages/torch/autograd/grad_mode.py", line 27, in decorate_context
    return func(*args, **kwargs)
  File "/stable-diffusion/./ldm/models/diffusion/sampler.py", line 347, in decode
    outs = self.p_sample(
  File "/usr/local/lib/python3.9/site-packages/torch/autograd/grad_mode.py", line 27, in decorate_context
    return func(*args, **kwargs)
  File "/stable-diffusion/./ldm/models/diffusion/ddim.py", line 79, in p_sample
    sigma_t = torch.full((b, 1, 1, 1), sigmas[index], device=device)
RuntimeError: CUDA error: device-side assert triggered
CUDA kernel errors might be asynchronously reported at some other API call,so the stacktrace below might be incorrect.
For debugging consider passing CUDA_LAUNCH_BLOCKING=1.

Image generation no longer work after that and restart is required. The common configuration that I used at that time is as follows:

sampler_name: DDIM
steps: 12
cfg_scale: 0.65 (I used other values as well between 0.01 to 0.95)

After applying the fix, I no longer receive the error above. Please note that the value of ddim_timesteps is not the same.

# Step 12 (before)
[  0  83 166 249 332 415 498 581 664 747 830 913 996]
# Step 12 (after)
[  0  83 166 249 332 415 498 581 664 747 830 913]

# Step 18 (before)
[  0  55 110 165 220 275 330 385 440 495 550 605 660 715 770 825 880 935 990]
# Step 18 (after)
[  0  55 110 165 220 275 330 385 440 495 550 605 660 715 770 825 880 935]

# Step 24 (before)
[  0  41  82 123 164 205 246 287 328 369 410 451 492 533 574 615 656 697 738 779 820 861 902 943 984]
# Step 24 (after)
[  0  41  82 123 164 205 246 287 328 369 410 451 492 533 574 615 656 697 738 779 820 861 902 943]

# Step 27 (before)
[  0  37  74 111 148 185 222 259 296 333 370 407 444 481 518 555 592 629 666 703 740 777 814 851 888 925 962 999]
# Step 27 (after)
[  0  37  74 111 148 185 222 259 296 333 370 407 444 481 518 555 592 629 666 703 740 777 814 851 888 925 962]

It seems that the error only happens when num_ddim_timesteps % 3 == 0 (step is multiple of 3). For example, using the step 27 will result in out of bound when using the original steps_out = ddim_timesteps + 1 code. This fix will resolve both issue as it excludes the extra time step in the bad cases.

lstein · 2022-10-18T02:31:54Z

That would explain why I never see the error. I tend to use steps of 50, 20 and 10 during testing.

I'll go ahead and approve.

lstein

LGTM

lstein approved these changes Oct 18, 2022

View reviewed changes

lstein merged commit 1c2bd27 into invoke-ai:development Oct 18, 2022

wfng92 deleted the patch-2 branch October 21, 2022 05:11

This was referenced Oct 27, 2022

Images have slightly changed (txt2img) #1254

Closed

Restore reproducibility when steps is not multiple of 3 #1356

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Fix img2img DDIM index out of bound #1137

Fix img2img DDIM index out of bound #1137

Uh oh!

wfng92 commented Oct 17, 2022 •

edited

Loading

Uh oh!

lstein commented Oct 17, 2022

Uh oh!

wfng92 commented Oct 18, 2022 •

edited

Loading

Uh oh!

lstein commented Oct 18, 2022

Uh oh!

lstein left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Fix img2img DDIM index out of bound #1137

Fix img2img DDIM index out of bound #1137

Uh oh!

Conversation

wfng92 commented Oct 17, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

lstein commented Oct 17, 2022

Uh oh!

wfng92 commented Oct 18, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

lstein commented Oct 18, 2022

Uh oh!

lstein left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

wfng92 commented Oct 17, 2022 •

edited

Loading

wfng92 commented Oct 18, 2022 •

edited

Loading